90 research outputs found

    Mapping the strand-specific transcriptome of fission yeast

    Get PDF
    Pervasive genome-wide transcription is widespread in eukaryotic cells, but key features of the transcriptome have yet to be fully characterized. a new study using antibody-based detection of RNA-DNA duplexes on tiling arrays now reveals a complex, strand-specific transcriptional world in fission yeast

    Apparent non-canonical trans-splicing is generated by reverse transcriptase in vitro

    Get PDF
    Trans-splicing, the in vivo joining of two RNA molecules, is well characterized in several groups of simple organisms but was long thought absent from fungi, plants and mammals. However, recent bioinformatic analyses of expressed sequence tag (EST) databases suggested widespread trans-splicing in mammals^1-2^. Splicing, including the characterised trans-splicing systems, involves conserved sequences at the splice junctions. Our analysis of a yeast non-coding RNA revealed that around 30% of the products of reverse transcription lacked an internal region of 117 nt, suggesting that the RNA was spliced. The junction sequences lacked canonical splice-sites but were flanked by direct repeats, and further analyses indicated that the apparent splicing actually arose because reverse transcriptase can switch templates during transcription^3^. Many newly identified, apparently trans-spliced, RNAs lacked canonical splice sites but were flanked by short regions of homology, leading us to question their authenticity. Here we report that all reported categories of non-canonical splicing could be replicated using an in vitro reverse transcription system with highly purified RNA substrates. We observed the reproducible occurrence of ostensible trans-splicing, exon shuffling and sense-antisense fusions. The latter generate apparent antisense non-coding RNAs, which are also reported to be abundant in humans^4^. Different reverse transcriptases can generate different products of template switching, providing a simple diagnostic. Many reported examples of splicing in the absence of canonical splicing signals may be artefacts of cDNA preparation

    Alteration of Sequence Specificity of the Type IIS Restriction Endonuclease BtsI

    Get PDF
    The Type IIS restriction endonuclease BtsI recognizes and digests at GCAGTG(2/0). It comprises two subunits: BtsIA and BtsIB. The BtsIB subunit contains the recognition domain, one catalytic domain for bottom strand nicking and part of the catalytic domain for the top strand nicking. BtsIA has the rest of the catalytic domain that is responsible for the DNA top strand nicking. BtsIA alone has no activity unless it mixes with BtsIB to reconstitute the BtsI activity. During characterization of the enzyme, we identified a BtsIB mutant R119A found to have a different digestion pattern from the wild type BtsI. After characterization, we found that BtsIB(R119A) is a novel restriction enzyme with a previously unreported recognition sequence CAGTG(2/0), which is named as BtsI-1. Compared with wild type BtsI, BtsI-1 showed different relative activities in NEB restriction enzyme reaction buffers NEB1, NEB2, NEB3 and NEB4 and less star activity. Similar to the wild type BtsIB subunit, the BtsI-1 B subunit alone can act as a bottom nicking enzyme recognizing CAGTG(-/0). This is the first successful case of a specificity change among this restriction endonuclease type

    Defining genes: a computational framework

    Get PDF
    The precise elucidation of the gene concept has become the subject of intense discussion in light of results from several, large high-throughput surveys of transcriptomes and proteomes. In previous work, we proposed an approach for constructing gene concepts that combines genomic heritability with elements of function. Here, we introduce a definition of the gene within a computational framework of cellular interactions. The definition seeks to satisfy the practical requirements imposed by annotation, capture logical aspects of regulation, and encompass the evolutionary property of homology

    Beyond the Gene

    Get PDF
    This paper is a response to the increasing difficulty biologists find in agreeing upon a definition of the gene, and indeed, the increasing disarray in which that concept finds itself. After briefly reviewing these problems, we propose an alternative to both the concept and the word gene—an alternative that, like the gene, is intended to capture the essence of inheritance, but which is both richer and more expressive. It is also clearer in its separation of what the organism statically is (what it tangibly inherits) and what it dynamically does (its functionality and behavior). Our proposal of a genetic functor, or genitor, is a sweeping extension of the classical genotype/phenotype paradigm, yet it appears to be faithful to the findings of contemporary biology, encompassing many of the recently emerging—and surprisingly complex—links between structure and functionality

    Generation of a non-small cell lung cancer transcriptome microarray

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Non-small cell lung cancer (NSCLC) is the leading cause of cancer mortality worldwide. At present no reliable biomarkers are available to guide the management of this condition. Microarray technology may allow appropriate biomarkers to be identified but present platforms are lacking disease focus and are thus likely to miss potentially vital information contained in patient tissue samples.</p> <p>Methods</p> <p>A combination of large-scale in-house sequencing, gene expression profiling and public sequence and gene expression data mining were used to characterise the transcriptome of NSCLC and the data used to generate a disease-focused microarray – the Lung Cancer DSA research tool.</p> <p>Results</p> <p>Built on the Affymetrix GeneChip platform, the Lung Cancer DSA research tool allows for interrogation of ~60,000 transcripts relevant to Lung Cancer, tens of thousands of which are unavailable on leading commercial microarrays.</p> <p>Conclusion</p> <p>We have developed the first high-density disease specific transcriptome microarray. We present the array design process and the results of experiments carried out to demonstrate the array's utility. This approach serves as a template for the development of other disease transcriptome microarrays, including non-neoplastic diseases.</p

    A new real-time PCR method to overcome significant quantitative inaccuracy due to slight amplification inhibition

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Real-time PCR analysis is a sensitive DNA quantification technique that has recently gained considerable attention in biotechnology, microbiology and molecular diagnostics. Although, the cycle-threshold (<it>Ct</it>) method is the present "gold standard", it is far from being a standard assay. Uniform reaction efficiency among samples is the most important assumption of this method. Nevertheless, some authors have reported that it may not be correct and a slight PCR efficiency decrease of about 4% could result in an error of up to 400% using the <it>Ct </it>method. This reaction efficiency decrease may be caused by inhibiting agents used during nucleic acid extraction or copurified from the biological sample.</p> <p>We propose a new method (<it>Cy</it><sub><it>0</it></sub>) that does not require the assumption of equal reaction efficiency between unknowns and standard curve.</p> <p>Results</p> <p>The <it>Cy</it><sub><it>0 </it></sub>method is based on the fit of Richards' equation to real-time PCR data by nonlinear regression in order to obtain the best fit estimators of reaction parameters. Subsequently, these parameters were used to calculate the <it>Cy</it><sub><it>0 </it></sub>value that minimizes the dependence of its value on PCR kinetic.</p> <p>The <it>Ct</it>, second derivative (<it>Cp</it>), sigmoidal curve fitting method (<it>SCF</it>) and <it>Cy</it><sub><it>0 </it></sub>methods were compared using two criteria: precision and accuracy. Our results demonstrated that, in optimal amplification conditions, these four methods are equally precise and accurate. However, when PCR efficiency was slightly decreased, diluting amplification mix quantity or adding a biological inhibitor such as IgG, the <it>SCF</it>, <it>Ct </it>and <it>Cp </it>methods were markedly impaired while the <it>Cy</it><sub><it>0 </it></sub>method gave significantly more accurate and precise results.</p> <p>Conclusion</p> <p>Our results demonstrate that <it>Cy</it><sub><it>0 </it></sub>represents a significant improvement over the standard methods for obtaining a reliable and precise nucleic acid quantification even in sub-optimal amplification conditions overcoming the underestimation caused by the presence of some PCR inhibitors.</p

    Expression of Conjoined Genes: Another Mechanism for Gene Regulation in Eukaryotes

    Get PDF
    From the ENCODE project, it is realized that almost every base of the entire human genome is transcribed. One class of transcripts resulting from this arises from the conjoined gene, which is formed by combining the exons of two or more distinct (parent) genes lying on the same strand of a chromosome. Only a very limited number of such genes are known, and the definition and terminologies used for them are highly variable in the public databases. In this work, we have computationally identified and manually curated 751 conjoined genes (CGs) in the human genome that are supported by at least one mRNA or EST sequence available in the NCBI database. 353 representative CGs, of which 291 (82%) could be confirmed, were subjected to experimental validation using RT-PCR and sequencing methods. We speculate that these genes are arising out of novel functional requirements and are not merely artifacts of transcription, since more than 70% of them are conserved in other vertebrate genomes. The unique splicing patterns exhibited by CGs reveal their possible roles in protein evolution or gene regulation. Novel CGs, for which no transcript is available, could be identified in 80% of randomly selected potential CG forming regions, indicating that their formation is a routine process. Formation of CGs is not only limited to human, as we have also identified 270 CGs in mouse and 227 in drosophila using our approach. Additionally, we propose a novel mechanism for the formation of CGs. Finally, we developed a database, ConjoinG, which contains detailed information about all the CGs (800 in total) identified in the human genome. In summary, our findings reveal new insights about the functionality of CGs in terms of another possible mechanism for gene regulation and genomic evolution and the mechanism leading to their formation

    Spatial Proximity and Similarity of the Epigenetic State of Genome Domains

    Get PDF
    Recent studies demonstrate that the organization of the chromatin within the nuclear space might play a crucial role in the regulation of gene expression. The ongoing progress in determination of the 3D structure of the nuclear chromatin allows one to study correlations between spatial proximity of genome domains and their epigenetic state. We combined the data on three-dimensional architecture of the whole human genome with results of high-throughput studies of the chromatin functional state and observed that fragments of different chromosomes that are spatially close tend to have similar patterns of histone modifications, methylation state, DNAse sensitivity, expression level, and chromatin states in general. Moreover, clustering of genome regions by spatial proximity produced compact clusters characterized by the high level of histone modifications and DNAse sensitivity and low methylation level, and loose clusters with the opposite characteristics. We also associated the spatial proximity data with previously detected chimeric transcripts and the results of RNA-seq experiments and observed that the frequency of formation of chimeric transcripts from fragments of two different chromosomes is higher among spatially proximal genome domains. A fair fraction of these chimeric transcripts seems to arise post-transcriptionally via trans-splicing
    corecore